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Abstract. We prove that any pair of bivariate trinomials has at most 5 isolated roots in the 
positive quadrant. The best previous upper bounds independent of the polynomial degrees counted 
only non-degenerate roots and even then gave much larger bounds, e.g., 248832 via a famous 
general result of Khovanski. Our bound is sharp, allows real exponents, and extends to certain 
systems of rt-variate fewnomials, giving improvements over earlier bounds by a factor exponential 
in the number of monomials. We also derive new sharper bounds on the number of real connected 
components of fewnomial hypersurfaces. 



1. Introduction 

Generalizing Descartes' Rule of Signs to polynomial systems has proven to be a significant chal- 
lenge. Recall that a weak version of this famous classical result asserts that any real univariate 
polynomial with exactly m monomial terms has at most m — 1 positive roots. This bound is sharp 
and generalizes easily to real exponents (cf. section ||). The original statement in Rene Descartes' La 
Geometrie pre-dates 1641. Proofs can be traced back to work of Gauss in 1828 and other authors 
earlier, but a definitive sharp bound for multivariate polynomial systems seems to have elluded us in 
the second millenium. This is particularly unfortunate since sparse polynomial systems now occur 



in applications as diverse as radar imaging [FH95| and chemistry [GH99 



One simple way to generalize the setting of Descartes' Rule to higher dimensions and real expo- 
nents is the following: 

Notation. For any c £ R* := R\ {0} and a = (ai, . . . , a„) £ R", let x a := ij 1 • • • i°" and call 
cx a a monomial term. We will refer to R" := {x € R™ | Xi > for all i} as the positive 
orthant, or quadrant or octant when n is respectively 2 or 3. Henceforth, we will assume 
that F := (/i,... , fk) where, for all i, fi £ M.[x a \ a £ R n ] and fi has exactly mi monomial 
terms. We call fi an n-variate mi-nomial|^ and, when mi,... , > 1, we call F a k x n 
fewnomial system^ (over M) of type (mi,... , mjt). Finally, we say a real root £ of F is 
isolated (resp. non-degeneratej iff the only arcf\ of real roots of F containing £ is £ itself 
(resp. the Jacobian of F, evaluated at £, has full rank). M 

Generalized Kushnirenko's Conjecture (GKC). Suppose F is an n x n fewnomial system of 
type (mi, . . . ,m n ). Then the maximum number of non-degenerate roots of F in the positive orthant 

is nr=i( TO * - !)■ ■ 

Remark 1. The polynomial system ^Jl"=i 1 ( x i ~ i), ■ ■ ■ > n^Li" 1 ^" — *)J easily shows that the 
conjectured maximum can at least be attained (if not exceeded), and integral exponents and coef- 
ficients suffice for this to happen, o 
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1 Quite naturally, we will also call 2-nomials binomials and 3-nomials t rinomi als. 
We use this terminology solely for succinctness. Fewnomial theory |Kho9lt is an important related body of 
work regarding a special class of functions which includes our m-nomials. 

3 i.e., point or homeomorphic image of the unit circle or (open, closed, or half-open) unit interval... 



2 



TIEN-YIEN LI, J. MAURICE ROJAS, AND XIAOSHEN WANG 



We can then succinctly state the original Kushnirenko's Conjecture (formulated in the mid-1970's 
by Anatoly G. Kushnirenko) as the special case of GKC where all the exponents of F are non-negative 
integers. Curiously, Kushnirenko's Conjecture was open for nearly three decades until Bertrand Haas 
found a counter-example in the case (n, mi, TO2) = (2,3,3) (see remark |J below and [HaaOOj). So 



we will derive a correct and sharp extension of Descartes' bound to this case, as well as certain 
additional cases with n>3, m„>4, and degeneracies allowed. Interestingly, the introduction of real 
exponents and degeneracies gives us more flexibility than trouble: The proof of our first main result 
uses little more than exponential coordinates and Rolle's Theorem from calculus. 

Definition 1. For any mi,... ,m n 6 N, let Af(mi , . . . ,m n ) (resp. Af' (mi,... ,m n )) denote the 
maximal number of isolated (resp. non-degenerate) roots an nxn fewnomial system of type (mi, . . . , m n 
can have in the positive orthant. ■ 

Theorem 1. For all to>3 we have Af(3, to) < 2 m — 2 and, in particular, Af(3,3) — 5, 7V(3,4)<14, 
and W(3,5)<30. Furthermore, A/ 7 (3,m) = A/'(3,m). 

The quantities A/"'(l,m 2 , . . . ,m„), A/"(1,to 2 , . . . , to„), AT' (2, m 2 ), and A/"(2,m 2 ) are much easier to 
to compute than A/"(3, 3): explicit formulae for them are stated in theorem || of section |2[ 

Remark 2. The value o/A/"(3, 3) was previously unknown and the authors are unaware of any earlier 
result implying the eguality Af'(3,m)=Af(3,m). In particular, the only other information previously 
known about Af' (3, to) or Af(3, to) was an upper bound of 3 m + 2 2( m + 2 )( m + 1 )/ 2 for A/"'(3, m) (see 
remark^ below). Form = 3, 4, 5 the latter formula evaluates to 248832, 23887872, and 4586471424 
respectively, o 

Remark 3. Note that Af (mi,... ,to„) < A/"(mi,... ,to„) /or aZ/ mi,... ,to„ G N, since non- 
degenerate roots of n x n fewnomial systems are always isolated roots. While we do not yet know of 
any cases where the inequality is strict, it is interesting to note that GKC can not be strengthened to 
allow degeneracies: For example, the polynomial system^ (xi(x3 — 1), £2(^3 — l),IIi=i( a; l — *) 2 + IliLife 
is of type (2,2,21), has 25 integral roots in the positive octant (all of which have singular Jacobian), 
but its GKC bound is 20. o 

Remark 4. Haas' counter-example to the original Kushnirenko's Conjecture is 

(x{ 08 + l.lxf - l.lx 2 ,xl 0% + l.lxf - l.lxx), 
which has 5 roots in the positive quadrant, thus contradicting its alleged GKC bound of 4 | HaaOO] j^] 



Jan Verschelde has also verified numerically that there are exactly 108 2 = 11664 complex roots, and 
thus (assuming the floating-point calculations were sufficiently good) each root is non- degenerate by 
Bezout's theorem, o 

The central observation that led to our proof may be of independent interest. We state it as 
assertion (3) of theorem || below. The first two assertions dramatically refine the bounds of Oleinik, 
Pctrov s ky, M i lnor, Thorn, an d Basu on the number of connected components of a real algebraic set 
[|OP4g| , |Mil64j , fTho65t|Bas9£ l in the special case of a single polynomial and extend to real exponents: 



Theorem 2. Let Z be the set of roots in K" of an n-variate m-nomial. Also let K'(n, /x) denote th 



e 



maximal number of non- degenerate roots in R™ of an nxn fewnomial system with exactly \jl distinct 
exponent vectors. Finally, let P C omp{n,m) (resp. P non (n,m)) be the maximal number of compact 
(resp. non-compact) connected components of any such Z. Then... 

1- Pcomp(n-,m) < 2|~/C'(rt, to)/2] , the multiple of 2 can be removed in the smooth case, and 
-Pcom P (l, m) = m - 1. 



Examples of this type wer e obse rved earlier by William Fulton around 1984 (see the first edition of [ ^u!98 ) and 



Bernd Sturmfels around 1997 ptu98 ], 

5 Dima Grigoriev informed the author on Sept. 8, 2000 that Konstantin A. Sevast'yanov, a colleague of Kushnirenko 
and contemporary of Grigoriev, had found a similar counter-example much earlier. Unfortunately, this counter- 
example does not seem to have been recorded and, tragically, Sevast'yanov committed suicide some time before 1997. 
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2. P n0 n(n,m)<2(P comp (n-l,m) + P nou (n-l,m)), P non {2, m) < fm/2] , and P uon (l,m) is 1 or 
according as m is or no£. 

3. (n, m) = (2,3) ==>• Z /ias no more i/ian 3 inflection points and no more than 1 isolated point 
of vertical tangency. 

Remark 5. Note that a non-compact component of Z can actually have compact closure, since R™ 
is not closed in W" 1 , e.g., {(x\,X2) 6R|. | x\ + x\ = 1}. Also, Bertrand Haas has pointed out that 
the bound on P non (2,m), at least in the case of integral exponents, may date back to work of Isaac 
Newton in the 1 r fi L century on power series, o 

While the above bounds on the number of connected components are non-explicit, they are stated 
so they can immediately incorporate any advance in computing JC'(n, m). So for a general and explicit 
bound independent of the underlying polynomial degrees now, one could, for instance, simply insert 
the explicit upper bound for K,'(n, fi) appearing in Khovanski's Theorem on Fewnomials (see section 
1.1 below). 

Corollary 1. Following the notation of theorem^, Z has no more than 2^ m ~ 1 ^ m ~ 2 ^ 2 2 n ~ 2 n(n + l) m_1 
connected components. In particular, a curve in the positive quadrant defined by a tetranomial has 
no more than 4 compact (resp. 2 non-compact) connected components. I 

1 )l 2 {2n) n - 1 (2n 2 - 
3.14, cor. 51 



The bound above is already significantly sharper than an earlier bound of 2 m ' m 
which held only for the smooth case, following from |Kho91 , 



-n + l) m 
The bounds of the- 
orem y are further refined in theorem |4| of section |^, and these additional bounds also improve an 
earlier result of the author on smooth algebraic hypersurfaces [RojOOa, cor. 3.1]. 



1.1. Important Related Results. 

It is interesting to note that the best current general bounds in the direction of GKC are exponential 
in the number of monomial terms of F, even for fixed n. Observe one of the masterpieces of real 
algebraic geometry. 



Khovanski's Theorem on Real Fewnomials (Special Case). (See also [Kho8C| and | Kho91 , 
cor. 7, sec. 3.12] .) Let F be an n x n fewnomial system and [i the total number of distinct exponent 
vectors of F. Then F has no more than (n + l)^2 Al ^ I_1 ^/ 2 non-degenerate roots in the positive 
orthant, i.e., K'(n, /i) < (n + i)m2Mm-i)/2_ m 

Remark 6. In the case (n, mi, m^) = (2, 3, m), one can divide both equations by suitable monomials 
to obtain fi = m + 2 and thus TV' (3, m) < /C'(3, m + 2). So Khovanski's bound implies A/"'(3, m) < 
gm+22(m+2)(m+i)/2^ jj- j s a ^ s0 ver y taS y t see that a simple application of Gaussian elimination 
yields K,'(n, m) <Af'(m — 1, . . . ,m — 1). 



Non-trivial lower bounds on even A/ 7 (3, m) are scarce and surprisingly little else is known about 
what an optimal version of Khovanski's Theorem on Fewnomials should resemble. For example, an 
earlier (conjectural) polyhedral generalization of Descartes' Rule to multivariate systems of equations 
proposed by Itenberg and Roy in 1996 [ [R96| (based on a famous cons truction of Oleg Viro from 
1989 and extensions by Bernd Sturmfels ]Stu94 1 ) was recently disproved |LW98 |. Also, a bit earlier, 
Bernd Sturmfels bet (and unfortu nately lost) US$500 on a challenge problem involving a family of 
polynomial systems of type (4,4) |LR97| . 

To the best of the authors' knowledge, all other general bounds on the number of real roots 
depend strongly on the individual exponents of F and are actually geared more toward counting 
complex roots, e.g., [BKK76, Kaz81 , BLR91 , Roj99 , RojOOa . So even proving A/"(3, 3) < oo already 
requires a different approach. Nevertheless, the aforementioned bounds can be quite practical when 
the exponents are integral and the degrees of the polynomials are small. 

In any event, it still remains unknown whether K,'(n,m) is polynomial in m for n fixed. (The 
polynomial system (x\ — 3xi +2, ... , x\ — 3x n + 2) shows us that fixing n is necessary.) Even the 
case of a trinomial and an m-nomial, in two variables, remains open. More to the point, it is also 
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unknown whether a simple modification (e.g., increasing the original GKC bound by a constant power 
or a factor exponential in n) changes the status of GKC from false to true. The 2k x 2k fewnomial 
system (x} 08 + l.lyf - l.l yi ,yl° 8 + l.Lrf 4 - l.lsfi, . . . , 4° 8 + l.lyf - l.ly k ,yf s + l.lxf - l.lx k ), 
thanks to Haas' counter-example (cf. remark ||), easily shows that the GKC bound now needs at 

least an extra multiple no smaller than (^f^j if it is to be salvaged. 

Remark 7. Domenico Napoletani has recently shown that to calculate J\f '(mi, . . . , m n ) for any 



given (mi, . . . ,m n ), it suffices to restrict to the case of integral exponents [NapOl]. Here, we will 
bound jV(3, m) directly without using this reduction, o 

Let us also make a related number-theoretic observation: Hendrik W. Lenstra has shown that for 
any fixed number field £, the maximal (finite) number of roots in £ of a univariate m-nomial, with 
integral exponents and coefficients in C, is quasi-quadratic in m and independent of the degree of 
the polynomial |Len99 |. Thus an immediately corollary of theorem ^ (and theorem |3| and remark 



of section |2j) is that Lenstra's result can be effectively extended to certain families of fewnomial 
systems, provided we fix n and restrict to real algebraic number fields. (Fixing n is necessary for 
the same reason as in the last paragraph.) 

Whether Lenstra's result can be more fully extended to polynomial systems is also an open 
question, even in the case of two bivariate trinomials. However, it is at least now known that that 
the number of geometrically^ isolated roots in C n of any k x n polynomial system can be bounded 
above by some function depending only on C, n, and the total number of distinct exponent vectors 



1.2. Organization of the Proofs. 

Section || provides some background and unites some simple cases where GKC in fact holds. We 
then prove theorems ^ and || in sections || and respectively. Proving the upper bound on 7V(3, m) 
turns out to be surprisingly elementary, but lowering the bound on 7V(3, 3) to 5 then becomes a 
more involved case by case analysis. 

Section |^ then gives an alternative geometric proof that jV(3, 3) < 6. We include this second proof 
for motivational purposes, since it was essentially the first improvement we found over ^'(3,3) < 
248832. We then derive bounds for the number of isolated singularities and inflection points of an 
m-nomial curve, and discuss how the underlying Newton polygons (cf. the next section) strongly 
control how A/"(3,3) can exceed 4 (cf. corollary |^ of section [|). Roughly speaking, we show that 
if a fewnomial system of type (3,3) has maximally many roots in the positive quadrant, then its 
underlying exponent vectors must be in "general position." In particular, just like Haas' counter- 
example, the underlying Newton polygons of any counter-example to this case of GKC must have 
Minkowski sum a hexagon (cf . sections |^ and [|) . 

2. The Pyramidal, Simplicial, and Zero Mixed Volume Cases 

Consider the following constructions. 

Definition 2. For any SCR™, let Conv(S') denote the smallest convex set containing S. Also, for 
any m-nomial of the form f := J2 a eA c aX a , we call Supp(/) := {a | c a ^ 0} the support of f, and 
define Newt(/) := Conv(Supp(/)) to be the Newton polytope of f. More generally, a polytope 
is simply the convex hull of any finite point set in M n . I 

Definition 3. Let F '= (/i, . . . , /„) be a fewnomial system and for all i let Li be the linear subspace 
affinely generated bySupp(fi). We call F pyramidal iff the following condition holds for alii: either 
LiDLj for all j j^i, or there is a j such that Li®Lj=Li®L for some line L% Li withO^L. Finally, 



6 A root is geometrically isolated iff it is a zero-dimensional component of the underlying zero set in C n , where C 
is the algebraic closure of C. 



7 In fact, as was done more explicitly in |Lcn9E] for the univariate case, one can also allow C to be any finite 
extension of the p-adic rationals. The latter setting is perhaps closer to our current focus since R, like the p-adics, is 
a metrically complete field. 
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we call any change of variables of the form (xi, . . . , x n ) i— > y A :— (y^ 11 • • • y^" 1 , • • ■ , J/i 1 ™ • ■ ■ Vn nn )> 
with A:=[aij] a real nx n matrix, a monomial change of variables. ■ 

For example, the polynomial systems from remark [l] are all pyramidal, but the systems from 
remarks [| and ^ are not pyramidal (cf. section 1). Pyramidal systems are a simple generalization of 
the so-called "triangular" systems popular in Grobner-basis papers on computer algebra. The latter 
family of systems simply consists of those F for which the equations and variables can be reordered 
so that for all i, fi depends only on Xi,... ,X{. Put another way, pyramidal systems are simply 
the image of a triangular system (with real exponents allowed) after multiplying the individual 
equations by arbitrary monomials, shuffling the equations, and then performing a monomial change 
of variables. In particular, we note the following elementary fact on monomial changes of variables. 

Proposition 1. If A is a real non-singular nxn matrix, then (x A ) A =x and the map x i— > x A is an 
analytic automorphism of the positive orthant. In particular, such a map preserves smooth points, 
singular points, and the number of compact and non-compact connected components, of analytic 
subvarieties of the positive orthant. Furthmore, this invariance also holds for fewnomial zero sets in 
the positive orthant. ■ 

The assertion on analytic subvarieties follows easily from an application of the chain rule from 
calculus, and noting that such monomial maps are also diffeomorphisms. That the same in- 
variance holds for fewnomial zero sets follows immediately upon observing that the substition 
(xi, ... , x n = (e Zl , . . . , e Zn ) maps any n-variate real m-nomial to a real analytic function, and noting 
that (ti, . . . , ijij i — ► (e* 1 , ... , e*") is a diffeomorphism from R n to R™ . 

Remark 8. The zero set of x% + X2 — 1, and the change of variables (xi,X2) i— > {^,yiyi), show 
that the number of isolated inflection points need not be preserved by such a map: the underlying 
curve goes from having no isolated inflection points to having one in the positive quadrant, o 

We will also need the following analogous geometric extension of the concept of an over-determined 
system. 

Definition 4. Given polytopes Pi,... , P n C R", we say that they have mixed volume^ zero iff 
for some d G {0, . . . , n — 1} there exists a d-dimensional subspace of 1" containing translates of Pi 
for at least d + 1 distinct i . ■ 

A simple special case of an n-tuple of polytopes with mixed volume zero is the n-tuple of Newton 
polytopes of an n x n fewnomial system where, say, the variable Xi does not appear. Indeed, by 
multiplying the individual m- normals by suitable monomials, and applying a suitable monomial 
change of variables, the following corollary of proposition [l] is immediate. 

Corollary 2. Suppose F is a fewnomial system, with only finitely many roots in the positive orthant, 
whose n-tuple of Newton polytopes has mixed volume zero. Then F has no roots in the positive 
orthant. ■ 

Indeed, modulo a suitable monomial change of variables, one need only observe that the existence of 
a single root in the positive orthant implies the existence of an entire ray of roots (parallel to some 
coordinate axis) in the positive orthant. 

We will also need the following elegant extension of Descartes' Rule to real exponents. It's proof 
involves a very simple induction using Rolle's Theorem (cf. the next section) and dividing by suitable 



monomials [Kho91| — tricks we will build upon in the next section. 

Definition 5. For any sequence {c\, . . . ,c m ) £ M. m , it's number of sign alternations is the 
number of pairs {j,j'} £ {1, • • • , m} such that j <j, CjCji <0, and Cj = when j <i<j' . ■ 

Univariate Generalized Descartes' Rule of Signs (UGDRS). Let ci,a\,... ,c m ,a m be any 
real numbers with a\ < ■ ■ ■ < a m . Then the number of positive roots of 2<=i c i x i' * s a t nnost the 
number of sign alternations in the sequence (ci, .. . ,c m ). In particular, M' (m)=Af(m) — m — I. ■ 



8 The reader curious about mixed volumes of polytopes in this context can consult JBZ8^ , [Roj99| for further 
discussion. 
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As a warm-up, we can now prove a stronger version of GKC for the following families of special 
cases. 

Theorem 3. Suppose F is an n x n fewnomial system of type (mi, . . . , m n ) (so mi, . . . , m n >l) 
and we restrict to those F which also satisfy one of the following conditions: 

(a) The n-tuple of Newton polytopes of F has mixed volume zero. 

(b) All the supports of F can be translated into a single set of cardinality <n + 1. 

(c) F is pyramidal. 

Then, following the notation of theorem 0. . . 

0. W(mi, . . . ,m n ) is respectively 0, 1, or n™=i( m * — 1) * n case ( a )' W> or (°)- 

1. In cases (a), (b), and (c), F has infinitely many roots =>■ F has no isolated roots. 

2. In general, M'(l,m 2 , ■ ■ ■ ,m n )=M(l,m 2 , ■ ■ ■ ,m n )=0, M'(2,m 2 , ■ ■ ■ , m n ) = M'(m 2 , ■ ■ ■ ,m„), 
and Af(2, m 2 , . . . , m„) =M(m 2 , . . . , m„). 

3. J\f'(2,m)=J\f(2,m)=m- 1. 

Proof: First note that the Newton polytopes must all be nonempty. The case (a) portion of 
assertions (0) and (1) then follows immediately from corollary|2[ Note also that the case (a) portion 
of assertion (0) immediately implies our formula for Af(l, m 2 , . . . , m n ) (and thus Af'(l, m 2 , . . . , m n ) 
as well) in assertion (2), since the underlying n-tuple of polytopes clearly has mixed volume zero. 

The case (b) portion of assertions (0) and (1) follows easily upon observing that F is a linear 
system of n equations in n monomial terms, after multiplying the individual equations by suitable 
monomial terms. We can then finish by proposition |l| 

To prove the case (c) portion of assertions (0) and (1), note that the case n— \ follows immediately 
from UGDRS. For n > 1, we have the following simple proof by induction: Assuming GKC holds 
for all (n — 1) x (n — 1) pyramidal systems, consider any n x n pyramidal system F. Then, via a 
suitable monomial change of variables, multiplying the individual equations by suitable monomials, 
and possibly reordering the fi, we can assume that f\ depends only on x\. (Otherwise, F wouldn't 
be pyramidal.) We thus obtain by UGDRS that f\ has at most mi — 1 positive roots. By back- 
substituting these roots into F' :— (f 2 , . . . ,/„), we obtain a new (n' — 1) x (n' — 1) pyramidal 
fewnomial system of type [m! 2 , . . . , m' n ,) with n' <n and m' 2 < m 2 , . . . , m! n < m n . By our induction 

hypothesis, we obtain that each such specialized F' has at most n™=2( m i — 1) isolated roots in 
the positive orthant, and thus F has at most Ili=i( TO i — 1) isolated roots in the positive orthant. 
(Remark [l] from the introduction shows us that this bound can indeed be attained.) 

Our recursive formulae for Af'(2, m 2l . . . ,m n ) and Af(2,m 2 , . . . ,m n ) from assertion (2) then 
follow by applying just the first step of the preceding induction argument, and noting that proposition 
|l| tells us that our change of variables preserves non-degenerate roots. 

Assertion (3) follows immediately from assertion (2) via UGDRS. ■ 

Remark 9. One can of course combine and interweave families (a), (b), and (c) to obtain less 
trivial examples where GKC is true. More generally, one can combine theorems [J and |^ to obtain 
bounds significantly sharper than Khovanski's Theorem on Real Fewnomials, free from Jacobian 
assumptions, for additional families of fewnomial systems, o 

3. Substitutions and Calculus: Proving Theorem |l] 

Let us preface our first main proof with some useful basic results. 

Lemma 1. For mi = 1 + dim Newt the computation of M'imi, . . . ,m n ) andAf(mi,... ,m n ) 
can be reduced to the case where fi := 1 ± Xi ± • • • ± x mi -i (with the signs in f\ not all "+") and, 
for all i, fi has 1 as one of its monomial terms. In particular, for mi —3, we can assume further 
that fi := 1 — xi — x 2 . 

Proof: By dividing each m^-normal by a suitable monomial term, we can immediately assume that 
all the fi possess the monomial term 1. In particular, we can also assume that the origin O is a 
vertex of Newt(/i). Note also that the sign condition on / a must obviously hold, for otherwise the 
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value of fi would be positive on the positive orthant. (The refinement for m — 3 then follows by 
picking the monomial term one divides f\ by a bit more carefully.) So we now need only check that 
the desired canonical form for f\ can be attained. 

Suppose /i := l+c\x ai +• • ■+c mi _ii£ am i~ 1 . By assumption, Newt(/i) is an mi-simplex with vertex 
set {O, <zi, . . . , a mi _i}, so oi, . . . , a mi _i are linearly independent. Now pick any a mi , ... , a n 6K n 
so that oi, . . . ,a n are linearly independent. The substitution x >— > x A (with A the matrix whose 
columns are a\, . . . , a n ) then clearly sends f\ i— ► 1 + cixi + • • • + c mi _ia; mi _i, and proposition [j] 
tells us that this change of variables preserves degenerate and non-degenerate roots in the positive 
orthant. Then, via the change of variables {x\,... ,x mi -i) i— > (xi/|ci|, . . . , x mi _i/|c mi _i|), we 
obtain that /i can indeed be placed in the desired form. (The latter change of variables preserves 
degenerate and non-degenerate roots in the positive orthant for even more obvious reasons.) I 

Recall that a polynomial p£C[xi, .. . , x n ] is homogeneous iff p(ax\, .. . ,ax n ) = a d p(xi,... ,x n ) 
for some non-negative integer d. 

Proposition 2. Suppose p S R[:z;i,i£2] is homogeneous and has degree d>0. Also let a, ft G R. 
Then there is a homogeneous q € M.[xi,X2], either identically zero or of degree d + 1, such that 
f t (t a (l - t)Pp(t, 1 - t))=t a - 1 (l - tY^qit, 1-t). In particular, f t (t a {l - t) p(t, 1 - *)) is idm- 
tically zero iff p(xi,X2) = Xx ^x% , cv — j3 = d, and a, (3£Z with a>0. ■ 

Proof: By the chain-rule, 4i (t a (l — t)@p(t, 1 — f)) is simply 

cd~-\l - tfp(t, 1 - t) + 0t a {l - tf^pit, 1 - t) + t a (l - tf(pi{t, 1-t)- p 2 (t, 1 - t)), 

where ft denotes the partial derivative of p with respect to Xi. Factoring out a multiple of t Q_1 (l — t)°~ 
from the preceding expression, we then easily obtain that we can in fact take 

q(x 1 ,x 2 ) = (ax 2 + /3x 1 )p(x 1 ,x 2 ) + Xix 2 (pi(xi, x 2 ) - p 2 (xi,x 2 )). 
The final assertion of our proposition then follows immediately. ■ 

Rolle's Theorem. |^| Let g : [a, b] — ► R be any function with a well-defined derivative g' defined 
on (a, b). Then g has r roots in [a, b] g' has at least r — 1 roots in (a, b). I 



Lemma 2. Let k>2. Then for any real ci,ai,6i, .. . ,Ck,ak,bk, the function 

f(t) := 1 + ci< ai (1 - i) bl + • • • + c k t a " (1 - t) bk 

has at most 2 k+1 — 2 roots in the open interval (0, 1). Furthermore, f has exactly r roots in (0, 1) ==>■ 
there exist c\ , . . . , Cfe € R such that 

f{t) := 1 + ~ Cl t ai (1 - <) bl + • ■ ■ + 4i afc (1 - t) bk 

has at least r roots in (0, 1), and no root of f is degenerate. 

Proof: Henceforth, let us assume all roots lie in the open interval (0, 1). Assume / has exactly r 
roots. Then by Rolle's Theorem, /' has at least r — 1 roots. Since 

k 

fit) = ^ ai_1 (i - t) b >-\ ai (i -t) + ^t), 

i=l 

and since t ak ~ 1 (l — t) bk ^ 1 never vanishes in (0, 1), the function 

fc-i 

5 i (t) :=c k {a k {l -t) + b k t) + ^- ak (1 - t) bi - b *( ai (l - t) + b t t) 

i=l 

has at least i — 1 roots. 



9 For a simple p roof, note that the special case r = 2 follows immediately from the Mean Value Theorem of calculus 
(see, e.g., [Ftud76, thm. 5.10, pg. 107]), since we can replace [a,b] by a sub-interval whose end-points are the roots of 
g. The general case then follows by replacing a (resp. b) by the smallest (resp. largest) root of g, and then subdividing 
[a, b] into r — 1 sub-intervals whose endpoints consist of the roots of g. 
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By Rolle's Theorem again, g'[ has at least r — 3 roots. By proposition ||, g'{(t) will then be 
of the form Yli=i Cit ai ~ ak ~ 2 (l — t) bi ~ bi ~ 2 q iy i(t, 1 — t), where the q^i € K[xi,X2] are homogeneous 
polynomials, which are either identically zero or of degree 3. In particular, we can assume that at 
least one q^i must be different from the zero polynomial. (For otherwise we would obtain that g'{ = 
identically, which would in turn imply that g\ is a linear function, and thus r < 3 < 2 k+2 — 2.) By 
again dividing by a suitable monomial in t and 1 — t, we then see that g'{ has the same number of 
roots as 

k-2 

92(t) := ft-i,i(t, 1 - t) + £ Cit*-^-^ - i) 64 - 6 "- 1 *,!^, 1 - <)• 

i=l 

Thus <?2 has at least r — 3 roots. 

By induction, we then easily obtain a sequence of polynomials g\ , . . . , gj , where j < A; and = 
qi.j-i (t, 1 — t) for some homogeneous geR[xi, £2] of degree 2 J — 1 having at least r — (2 J — 1) roots. 
So by Rolle's Theorem one last time, r < (2 J — 1) + (2 J — 1) < 2 fe+1 — 2 and we are done with the 
first part of our lemma. 

To prove the second part, note that the first part of our lemma implies that / has only finitely 
many critical values — no more than 2 fc+1 — 2, in fact. So for all <5€R* with \6\ sufficiently small, 
/ — S will have no degenerate roots. We can in fact guarantee that / — 8 will also have at least 
r non-degenerate roots in (0, 1) as follows: Let n + (resp. n_) be the number of roots t of / with 
/'(t) = 0and f"(t)>0 (resp. f"(t)<0). Clearly then, for alltfeM* with \S\ sufficiently small, / - 5 
will have exactly r + n_ — 71+ or r + n + — n~ roots, according as 8 > or 5 < 0. (The analogous 
statement for roots in (0, 1) holds as well, since (0, 1) is open.) So let 8 be sufficiently small, and of 
the correct sign, so that / — S has at least r roots in (0, 1) and no degenerate roots. 

To conclude, simply let ^i :=, j £ S f° r a U Since / is thus y~|j we are done. I 

Proof of Theorem [I]: First note that by lemma [|, we can immediately reduce to the case of a 
fewnomial system of the form F := (1 — x\ — X2, 1 + cix^x^ 1 + ■ • • + Cm-ix"" 1-1 :^'" -1 ), an d this 
reduction preserves the degeneracy or non-degeneracy of any root of F. We can then simply solve for 
X2 via the first equation and then substitute into the second equation to obtain a bijection between 
the roots of F in the positive quadrant and the roots of f(t) := l+cit ai (1 — t) bl + ■ ■ ■+c ro _ii 0m_1 (1 — 
£) fc ™-! with < t < 1. A simple Jacobian calculation yields that (Ci: C2) is a degenerate root of 
F <=► E™T 1 c I Cr^ 1 (l-Ci) bl - 1 (a I (l-Ci)-^Ci) = and /(Ci)=Q] /'(Ci) = /(Ci)=0. So 
degenerate roots of our univariate reduction correspond bijectively to degenerate roots of F. 

By lemma |^, and the fact that A/"' (3, m) <7V(3, to), we immediately obtain A/"(3, to) < 2 m — 2 and 
A/"'(3,to) =A/"(3, m). Our upper bounds on J\f(3,4) and Af(3,5) are then simply specializations of 
our new upper bound for A/"(3, to). 

To now prove that A/"(3, 3) = 5, thanks to Haas' counter-example from remark^, it suffices to show 
that Af(3, 3) < 6. To do this, let us specialize our preceding notation to to = 3, (ci, c 2 ) = (—A, — fl), 
and (czi, 61, 02, 62) = (a, c, c?), for some a, 6, c, K and positive ^4 and £?. (Restricting to positive 
A and £> can easily be done simply by dividing /2 by a suitable monomial term, a la the proof of 
lemma |l|.) 

By using symmetry we can then clearly reduce to the following cases: 
A. a, b, c> and d < B. a, c > and b, d < 

C. a, b > and c, c? < D. a, 6, c, rf > 

E. a > and b, c, d < F. a,b,c,d <0 

G. a, d > 0, 6, c < H. At least one of the numbers a, 6, c, d is zero. 

In particular, our earlier substitution trick tells us that it suffices to show that any 

/(*) := 1 - At a {l - t) b - Bt c (l - t) d , 

with all roots non-degenerate, always has strictly less than 6 roots in the open interval (0,1). 
So let r be the number of roots of any such non-degenerate / in (0, 1). 
Let us now prove r<6 in all 8 cases: 
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A. a, b, c> 0, d < 0: 

Let Q(x) — 1 — Ax a (l — x) b and R{x) = Bx c (\ — x) d . The roots of / may be regarded as 
the intersections of y = Q(x) and y = R(x) in the positive quadrant. Since lim x ^ Q + Q(x) = 1, 
lim^^i- Q(x) — 1, lim x _^o+ R{%) — 0, and lim x _>i- R(x) — oo, it is easy to see via the 
Intermediate Value Theorem of calculus that the number of intersections must be odd. (One 
need only note that / = Q — R and that the signs of /' at the ordered roots of / alternate.) 
So r<6. 

B. a, c > 0, b, d < 0: 

By an argument similar to that of case A, r is odd and thus less than 6. 

C. a, b > 0, c, d < 0: 
See lemma [| below. 

D. a, b,c,d> 0: 

See lemma |] below. 

E. a > 0, b, c, d < 0: 
See lemma [| below. 

F. a, b,c,d < 0. 

Multiplying f(t) by £ max { - °> -c }(l - ^max{-b,-d} ^ we can i mmec ji a tely reduce to case D. 

G. a, d > 0, b, c < 0: 
See lemma below. 

H. At least one of the numbers a, b, c, d is zero: 

Use lemma ^ below, noting that our hypotheses here imply that either F or F is a quadratic 
polynomial. 
This concludes the proof of theorem |]. ■ 
We now detail the lemmata we cited above. 

Lemma 3. Following the notation of the proof of theorem ^, recall that r is the number of roots of 
f(t):=l — At a (l—t) b — Bt c (l — t) d in the open interval (0,1), where f has no degenerate roots. Also 
let g(t) := ^t a - c (l - t) b - d (-a(l - t) + bt) - c(l - t) + dt, 

F{u) := -a{a - c)(o - c - l)u 3 + (o - c)[2o(6 - d + 1) + b(a - c+ l)]u 2 + (d - b)[a(b -d+l) + 2b{a - c + l)]u + b(b-d){b-d-l), 
and 

F(u) := -c(c - o)(c - o - l)ii 3 + (c - a)[2c{d - b + 1) + <J(c- a + l)]u 2 + (6 - #(<2 - 6 + 1) + 2d(c - a + l)]u + d(d - 6)(d - 6 - 1). 
Finally, let N (resp. M) be the number of roots in (0,1) o/g (resp. the maximum of the number of 
positive roots of F and F ). Then r-3<N - 2<M<3. 

Proof: Just as in the proof of lemma ||, we easily see by Rolle's Theorem and division by suitable 
monomials in t and 1— t that r— 1 is no more than the number of roots in (0, 1) of g. So r— 1 < N. Note 
also that, in a similar way, r — 1 is no more than the number of roots of g(t) := ^t c ~ a {\ — t) d ~ h g{t) 
in (0, 1), and the latter function has the same number of roots in (0, 1) as g. 

To conclude, simply note that for suitable a, /3, 7, 5 G M, we have that F(^-^) =t a (l — t)Pg"(t) 
and F(±f±) = f(l ~ t) s g"(t). So, by our preceding trick again, N - 2 < M, and thus r - 3 < M. 
That M < 3 is clear from the fundamental theorem of algebra. ■ 

Lemma 4. Following the notation of lemma ^, let 

T( x ) := ^x a - c {l - x) b - d (-a(l - x) + bx), S(x) := c - (c+ <Qa;, 

f(x) := ^x c - a {l - x) d - b (-c(l -x) + dx), S(x) =a-(a + b)x. 
Then [a,b > and c,d<0] r < 6. 

Proof: By lemma [| we are done if M < 3 or N < 5. So let us assume M = 3 to derive a 
contradiction. By Descartes' Rule of Signs (see section [2] for a generalization), the coefficients of 
F(u) or F(u) (ordered by exponent) must have alternating signs. Thus, since a, a — c,b,b — d> 0, 
we have that a — c — 1 and b — d — 1 must have the same sign. We then need to discuss two cases: 
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• a — c — 1 < and b — d — 1 < 0: 

This implies c— a+ 1 > and d— b+ 1 > 0. Consequently, coefficients of u 3 and u 2 in F(u) 
and -F(u) are all positive — a contradiction. 

• a — c — 1 > and & — d — 1 > 0: 

The roots of g in (0,1) can be regarded as intersections of y = T{x) and y — S(x), for 
< x < 1. Since r(ar) < for < x < 1 and T(x) > for < 1 - x -C 1, there is a 
smallest positive local minimum cq of T with T(co) < Q Thus for x near Co, T"(x) > 0. Since 
T"(a;) < for < x < 1, there is c* G (0, c ) such that T"(c*) = 0. Let (xi,yi), . . . , {x K ,y K ) 
be the intersection points of y = T{x) and y — S(x) with x\ < X2 < ■ ■ ■ < Xk, where a 
tangent point is counted twice. Then for all i £ {1, ... , K — 1} there is a € (a;,, Xj+x) with 
T'(a) = -(c + d) > 0, and for all is {1, . . . , K - 2} there is a d; S (c 4 ,c 4+1 ) with T"{ck)=0. 
Note that Co < Cx- Thus c* < dx and therefore T"(x) = has at least K — 1 solutions. Since 
T" and -F have the same number of positive roots (observing that T"(u)/F(u) is a monomial 
in u and 1 — u), we have N — 1 < K — 1 < 3. ■ 

Lemma 5. Following the notation of lemma 0, a, 6, c, d > ==>■ r < 6. 

Proof: Again, by lemma ||, we need only show that M<3 or N<5. So let us assume M = 3. Then 
by Descartes' Rule of Signs, (a — c) (a — c — 1) and (6 — d)(6 — d — 1) in the coefficients of u 3 and u° 
in F(u) must have the same sign. There are now four cases to be examined. 

• The signs of a — c, a — c — 1, b — d, and b — d — 1 are respectively +, — , +, and — : 

This makes the signs of coefficients of u 3 and u 2 of F(u) both positive. 

• The signs of a — c, a — c — 1, b — d, and b — d — 1 are respectively — , — , +, and +: 

Since b — d > 0, we have d — b < and d — b — 1 < 0. This makes the constant term of 
F(u) positive, and hence, the coefficients of u and u 2 of F(u) must respectively be negative 
and positive. That is, c(d- b+ 1) + 2d(c - a + 1) < and 2c(d - b + 1) + d(c- a + 1) > 0. 
Thus, — c(d — 6 + 1) + d(c — a + 1) < 0. This is false, since 6 - d - 1 > and a - c - 1 < 0. 

• The signs of a — c, a — c — 1, b — d, and 6 — d — 1 are all negative: 

By Descartes' rule of signs, d — b — 1 and c — a — 1 in the coefficients of y 3 and y° of -F(y) 
must have the same sign. If both are negative, then coefficients of u 3 and u 2 of F(u) would 
both be negative. Thus d — b — 1 > and c — a — 1 > 0. It is easy to see that T{x) < for 
< x < 1 and T(x) > for < 1 - x < 1 and lim 2 ,^ + T(x) = lim^i- T(x) = 0. Now let 
Lq = min{c | 1 > c > 0, T(c) < and c is a local minimum} and Uq = max{c | 1 > 
c > Lqj T'(c) is a local maximum}. Then for x near Lq, T"(x) > 0. Since T"(x) < for 

< x < 1, there exists < L a < L such that f"(Li) = 0. Similarly, there is a f/i S (t/ 0) 1) 
such that f"(f/x) = 0. 

The roots of ^t c ~ a (l — t) d ~ b g can be regarded as the intersections of y = T(x) and y = S(x), 
for < x < 1. Let (xx, j/i), • • ■ , {xk,Vk) be the intersection points with x\ < X2 < ■ ■ ■ < Xfe, 
where a tangent point is counted twice. Then there exist Xi < Cj < x^+x such that T'(ci) = 
-(a + 6) < 0, i = 1, . . . , k - 1 and Cj < d t < c i+ i such that f"(d t ) = 0, i = 1, . . . , k - 2. 
If xx > Loj then Lx < dx. If x\ < L 0l then T(xi) < 0. This implies T(x l ) < for all 

1 = 1, . . . , k — 2, since the slope — (a + 6) of S'(x) is negative. Therefore, Xk-2 < Uo and hence 
dfc-2 < C/i- So T"{x) = has at least ft — 1 solutions. Since T"{x) — and = have the 
same number of solutions, we have N — l<fc — l<Af = 3. 

• The signs of a — c, a — c — 1, 6 — d, and — d — 1 are all positive: 

Since a — c — 1 > and — d — 1 > 0, the proof follows the same line of arguments as the 
last case by considering the intersections of T(x) and S(x) instead. 

■ 

Lemma 6. Following the notation of lemma [a > and 6, c, d < 0/ r < 6. 

Proof: Once again, by lemma |, it suffices to show that M<3 or N<5. So let us assume that 
M = 3. By checking coefficients of u 3 and u° in F(u), Descartes' Rule of Signs tells us that a — c— 1 
and (b — d) (b — d — 1) must have different signs. There are now three cases to be examined. 
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• a — c — 1, b — d, and 6 — d — 1 are all negative. 

Then the signs of the coefficients of both u 3 and u 2 in F(u) will all be positive. 

• The signs of a — c — 1, b — d, and b — d — 1 are respectively — , +, and +. 

Multiplying / by x~ c (l - x)~ d yields u(x) := x~ c (l - x)- d ~ Ax a - C {l - xf- d - B, where 
— c > 0, a — c > 0, — d > 1, and — d + b > 1. The roots of w in (0, 1) can be regarded as 
the intersections of the curves y = v(x) = x~ c (l — x)~ d — Ax a ~ c (l — x) b ~ d and y = B. Let 
(xi, yi), . . . , (x n , y n ) be the intersection points of y — v(x) and y = B with x\ < X2 < ■ ■ ■ < x n , 
where a tangent point is counted twice. Then there exist x% < Ci < such that v'(ci) = 0, 
i = 1, . . . , n — 1 = N, Thus v' has at least N roots in (0, 1). A straightforward computation 
then yields, 

v'(x) = Ax a - C ~ x {l - xf-^i-ia - c)(l - x) + (b - d)x) + x~ c - l {l - x)- d - 1 (-c(l - x) + dx), 
which clearly has the same number of roots in (0, 1) as 

t{x) := Ax a (l ~ x) b (-(a -c)(l-x) + (b- d)x) - c(l - x) + dx. 
Thus t" has at least N — 2 roots in (0, 1). Since 

t"(x)/A = x a - 2 (l - x) b - 2 [-(a - c)a(a - 1)(1 - x) 3 + a((a + 1)(6 - d) + 2(6 + l)(a - c))x(l - xf 
-b((b + l)(a - c) + 2(6 - d)(a + l))x 2 {l - x) + (6 - d)6(6 - l)x 3 }, 

t" has as many roots in (0, 1) as 

P(u) = -(a - c)a(a - l)u 3 + a((a + 1)(6 - d) + 2(6 + l)(o - c))u 2 
-6((6 + l)(a - c) + 2(6 - d)(a + l))u + (6 - rf)6(6 - 1) 

has positive roots. Since a — 1 < a — c — 1<0, the coefficients of u 3 and u° in P(w) are both 
positive. Thus P has at most 2 positive roots and we obtain N — 2 < 2. 

• The signs of a — c — 1, b — d, and 6 — d — 1 are respectively +, +, and — : 

Since a — c — 1 > and 6 — d > 0, it is easy to see that T(x) < for 0<i<1 and 
lim x ^ 1 - T(x) = — oo. If T(x) has no local minimum, then y = T(x) and y = 5(x) have at most 
one intersection point. Otherwise, let Co = min{c | l>c>0, c is a local minimum of T}. 
The rest of the proof is similar to that of lemma ^. ■ 

Lemma 7. Following the notation of lemma ^ [a, d > and 6, c < 0/ r < 6. 

Proof: One last time, lemma || tells us that it suffices to prove that M < 3 or N < 5. So let's assume 
that M — 3. Checking signs of coefficients of u 3 and u° of both F(u) and F(u), Descartes' Rule of 
Signs tells us that a — c — 1 < and d — b — 1 < 0. On the other hand, the alternating signs of 
coefficients of u 2 and u 1 of F(u) yield 

2a(6- d+ 1) +b(a- c + 1) < and a(6 - d + 1) + 26(a - c + 1) > 0. 

Thus, 

-a(b - d + 1) + b(a - c+ 1) = a(d - 1) + 6(1 - c) > 0. 
This is impossible, since d — 1 < d — 1 — 6<0, 1 — c > 0, a > 0, and 6 < 0. ■ 
Remark 10. When A = 1.12, B = 0.71, a = 0.5, 6 = 0.02, c = -0.05, and d = 1.8, 
(1) /(x) = 1 - Ac Q (l - x) b - Bx c (l - x) d = 0, < x < 1 



has 5 solutions. They are, approximately, {0.00396494,0.02986317,0.4354707,0.72522344,0.99620026}. o 
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4. A Simple Geometric Approach 
Let us begin with an extension of Rollc's Theorem to smooth curves in the plane. 

Lemma 8. Suppose CcR 2 is an arc (i.e., image of an interval or circle under a continuous map) 
with 

1. A unique well-defined tangent line for each xGC. 

2. At most I isolated^ inflection points. 

3. At most V isolated points of vertical tangency. 

Then the maximum finite number of intersections of any line with C is I + V + 2. 

Proof: Let S 1 be the realization of the circle obtained by identifying and it in the closed interval 
[0, 7r]. Consider the natural map tf> : C — ► S 1 obtained by x i— * 9 X where 9 X is the angle the normal 
line of x forms with the xi-axis. We claim that any 9GS 1 has at most I + V + 1 pre-images under 

4>. 

To see why, note that by assumption we can express C as the union of no more than I + V + 1 
arcs where (a) any distinct pair of arcs is either disjoint or meets at < 2 end-points, and (b) every 
end-point is either an isolated point of inflection or vertical tangency of C. Calling these arcs basic 
arcs, it is then clear that the interior of any basic arc is homeomorphic (via <fr) to a connected subset 
of S' 1 \{0}. Furthermore, by construction, the cardinality of _1 (O) is exactly V. So we indeed obtain 
that any 9&S 1 has at most I + V + 1 pre-images under 4>. 

Now note that any line {x \ m\X\ + 7122X2 = mo} normal to C forms an acute angle of ArcTan(^-) 
with the Si-axis. Thus, the number of contact points C has with the differential system 

dx\ 8x2 

-dT = m2 ' -dT = - mi 



is^| at most I + V + 1. By Rolle's Theorem for Dynamical Systems in the Plane (see, e.g., [Kho91 



corollary, pg. 23]), we then obtain that the number of intersections of {x \ m\X\ + 1112X2 = mo} with 
C is at most I + V + 2, for any real (mo, mi, 1712)- So we are done. I 

Remark 11. The bound from lemma^ is tight in all cases. This is easily revealed by the following 
examples and their obvious extensions: 




Figure 1. Lemma || gives a tight bound for (I,V) e 
{(0, 0), (3, 1), (4, 1), (3, 2), (7, 5)} and this generalizes easily to arbitrary (7, V). 

The authors do not presently know whether this bound remains tight when restricted to fewnomial 
zero sets, o 



10 Relative to the locus of inflection points. 

11 i.e., the number of points at which some solution of the differential system has a tangent line in common with 
C is... 
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We are now ready to give a quick geometrically motivated proof of the nearly optimal bound 
■A/"(3, 3) < 6. This "second" proof of 7V"(3, 3) < 6 was actually the original motivation behind this 
paper. 

Short Geometric Proof of -A/"(3, 3)<6: Theorem || implies that we can assume that fi and f 2 
have Newton polygons that are each triangles. Letting Z denote the zero set of f% in lemma 
|l| of the last section tells us that we can assume that f\ = 1 ± x\ ± x 2 \ and by proposition [l] the 
underlying change of variables also implies that Z is diffcomorphic to a line. So Z is smooth and 
theorem ^ tells us that Z has no more than 3 inflection points and 1 vertical tangent. So we now 
need only check how many intersections Z will have with the line {x | 1 ± x\ ± X2 = 0}. By lemma 
||, we are done. ■ 

It turns out that inflection points for m-nomial curves are easy to describe in a m-nomial way. 
Let 

Lemma 9. Suppose f : — ► R is analytic and Z is the real zero set of f. Then z is an inflection 
point or a singular point of Z => f(z) = and [Off - (9 2 f) 2 — 28182 f ■ dif ■ 82 f + 8 2 f ■ (8if) 2 ] x=z = 0. 
In particular, in the case / := 1 + c\x ai + • • • + c m x am , the preceding polynomial in derivatives is, up 
to a multiple which is a monomial in the x ai , a cubic polynomial homogeneous in the CiX ai . 

Proof: In the case of a singular point, the first assertion is trivial. Assuming 82 f '^0 at an inflection 
point then a straightforward computation of 8\x2 (via implicit differentation and the chain rule) 
proves the first assertion. If B%f — at an inflection point then we must have 8\f 7^ 0. So by 
computing 8 2 x± instead, we arrive at the remaining case of the first assertion. The second assertion 
also follows routinely. ■ 

Let us now reveal the hardest case of our result for pairs of trinomials. First note that while one 
can naturally associate a pair of polygons to F when n = 2, we can also associate a single polygon by 
forming the Minkowski sum Pp := Newt ( ) + Newt (/ 2 ). We can then give the following addendum 
to theorem |l| (with an independent proof). 

Corollary 3. Following the notation of GKC and theorem [J, consider the case (77, mi, 777,2) = 
(2,3,3). Then A/"(3,3) is respectively 0, 2, or 4, according as we restrict to those F with Pp a 
line segment, triangle, or i-gon with £e{4, 5}. 

Proof: The segment case follows immediately from corollary 0. For the remaining cases, proposition 
[j] implies that we can assume /1 := 1 — x\ — X2 and fi := 1 + Ax\x\ + Bxfx^- In particular, it is easily 
verified that the underlying monomial change of variables preserves the postivity of angles between 
lines (in exponent space), so the number of edges of Pp is unchanged. 

Let Si :—Axix\, S2 ■— Bx\x 2 l , and let Z denote the zero set of / 2 . Observe that lemma |9| tells 
us that we can bound the number of inflection points of Z by analyzing the roots of a homogeneous 
polynomial in (S^S^) of degree < 3. So let us now explicitly examine this polynomial in our 
polygonally defined cases. 

Clearly then, the triangle case corresponds to setting a = d > and b = c = 0. We then obtain 
that [x is an inflection point or a singular point of Z] ==> 1 + Si + S2 = and Si + S2 = 0. So 
Z has no inflection points (or singularities). It is also even easier to see that Z has no vertical 
tangents. So by lemma ||, A/"(3, 3) < 2 in this case. To see that equality can hold in this case, simply 
consider F := {x\ + x\ - 25, X\ + x 2 - 7), which has Pp = Conv({(0, 0), (3, 0), (0, 3)}) and root set 
{(3, 4), (4,3)}. 

Similarly, the quadrilateral case corresponds to setting & = c = and a, d>0. We then get the pair 
of equations 1 + Sx + S 2 = and a(d - 1)51 - d(a - 1)S 2 =0, with a,d^ {0,1}. (If {a, d} n {0, 1} ^0 
then F, or a suitable pair of linear combination of F, would be pyramidal and we would be done 
by theorem [|) So Z can have at most 1 inflection point. It is also even easier to see that Z has 
no vertical tangents. So by another application of lemma ^, A/"(3, 3) < 4 in this case. To see that 
equality can hold in this case, simply consider the system {x\ — Zxi + 2,x\~ 3x2 + 2), which has 
P F = Conv({0, (2, 0), (2, 2), (0, 2)}) and root set {(1, 1), (1, 2), (2, 1), (2, 2)}. 

Finally, the pentagonal case corresponds to setting 6 = and a,c,d>0. We then get the pair of 
equations l + Si + S 2 = and a 2 (d-l)S 2 + a(ad-d-2c)SiS2-c(c+d)S 2 1 = 0, with ac(d-l)(c+d)^0. 
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(Similar to the last case, it is easily checked that if the last condition were violated, then we would 
be back in one of our earlier solved cases.) However, a simple check of the discriminant of the 
above quadratic form in (Si,^) shows that there is at most 1 root, counting multiplicities, in any 
fixed quadrant. So, similar to the last case, we obtain jV(3, 3) < 4 in this case. To see that the 
equality can hold in this case, simply consider the system {x\ — 7x2 + 12, —1 + x\X2 — x\), which 
has P F = Conv({O,(2,0),(2,2),(l,3),(0,2)}) and root set {(3, (4, 2 ± V^)}. ■ 

5. Monomial Morse Functions and Connected Components: Proving Theorem ^ 

A construction which will prove quite useful when we count connected components via critical 
points of maps is to find a monomial which is a Morse function relative to a given fewnomial zero 
set. 

Remark 12. In what follows, we will always understand dim (resp. dime,) to mean real (resp. 
complex) dimension. Also, unless otherwise noted, "dimension" will be understood to mean real 
dimension, o. 



Lemma 10. Suppose Z is the zero set in W] of an n-variate m-nomial f . Then there exists a finite 
union of hyperplanes Hz C K n such that for all a € K™ \ Hz we have. . . 

1. Every critical point of the restriction of x a to Z is non-degenerate. 

2. The level set in Z of any regular value of x a has dimension <n — 2. 

3. No connected component of Z (other than an isolated point) is contained in any level set of 
x a . 

4. Every unbounded connected component of Z has unbounded values of x a . 

Proof: Let us prove the last two assertion first: Since the number of connected components of Z is 
finite,]^] we can temporarily assume that Z consists of a single connected component. Then, if we 
could find n linearly independent a with Zc{x£ M™ | x a = c a } for some c a , proposition [l| would 
immediately imply that Z is contained in a point. Similarly, if we could find n linearly independent 
a for which the restriction of x a to Z is bounded, then we would obtain by proposition [l] again that 
Z is bounded — a contradiction. 

To prove the rest of our lemma, let us return to general Z and consider the substitution xi = e Zi . 
A simple derivative computation (noting that x i— ► (e Xl , . . . , e Xn ) is a diffcmorphism between K™ 
and (R*)™) then shows that it suffices to instead prove the analogous statement where / is replaced 
by a real exponential sum (a real analytic function in any event) and x a is replaced by the linear 
form a\Z\ + • • ■ + a n z n . The latter analogue is then nothing more than an application of [BCSS9S, 
lemma 1, pg. 304], combined with Khovanski's Theorem on Fewnomials to ensure that Hz is finite 
instead of countable. ■ 

We will also need the followi ng usef ul perturbation result, which can be derived via a simple 
homotopy argument. (See, e.g., [ Bas99 , lemma 2] for even stronger results of this form in the case 
of integral exponents.) 



Lemma 11. Following the notation of lemma It, let Z$ denote the solution set of |/| < 8 in IR™ 

o o 

and Zs its boundary. Then for 5>0 sufficiently small, Zg and its closure are smooth, and there is a 
bijection between the connected components of Z and Zs which preserves compact and non-compact 
components. I 

Finally, we will need the following two results (the latter dating back to an analogous result of 
Giusti and Heintz [GH93, sec. 3.4.1] in the complex algebraic case, if not earlier) for dealing with 
over-determined fewnomial systems. 

Real Dimension Lemma. Suppose U is an open subset ofW 1 , W is an irreducible real analytic 
subvariety of U , and g : U — ► R is a real analytic function with g(w) ^ for some w <G W . Then 
diml^n {z&U | g(z) = 0}<dimW. 



12 The smoo th case is detailed in [Kho91, sec. 3.14] and the case of integral exponents (allowing degeneracy) is a 
special case of |Roj00a lemma 3.2]. In any event, the proof of the latter lemma extends easily to real exponents. 
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Proof: Let d:=dim W and let Wc be the complexification of W. Then Wc is an irreducible analytic 
subvariety of U' where U' C C" is an open subset containing U and dime Wc > d. Furthermore, 
by JGR84 Active Lemma, pg. 100] we have dim c Wc H {z G U' \ g(z) = 0} = dim c Wc - 1- So, 
VF n {z G [/ | .g(z) = 0} (the real part of Wc O {z £ U' \ g(z) = 0}) must have strictly smaller real 
dimension than W . ■ 

Lemma 12. Suppose k>n and that F:={f\, . . . , fk) is a k x n fewnomial system. Assume further 
that there are at most m distinct exponent vectors in F . Then there exist real numbers aij such that 

1. the real zero set of G.= (au/i + • • • + a\kfk, ■ ■ ■ , »ni/i + • • • + a n kfk) "is the union of the real 
zero set of (/i, . . . , fk) and a finite (possibly empty) set of points. 

2. G is of type (m — 1, . . . , m — 1) cmd /ias no more i/ian m distinct exponent vectors. 

Proof: Let us first make the substitution Xi = e Zi , noting that m (e Xl , . . . , e x ™ ) is a diffeomorphism 
between R" and R" which preserves the dimension of the underlying subanalytic varieties. Now 
pick an, . . . , aifc G R so that au/i + • • ■ + aifcA- is not identically zero. Fix a set of points {wi}, 
one lying in each irreducible component of the zero set Z\ of a\\f\ + • ■ • + a\kfk in R n - Let us then 
pick d2i, . • • , a2k so that 021/1 + ■ ■ ■ + o-2kfk does not vanish at any {uii}. By the Real Dimension 
Lemma we then obtain that the zero set Z2 of (on/i + • • • + difc/fc, 021/1 + • • • + Q>2kfk) in R™ is the 
union of a diffcomorphic copy of Z and a real analytic variety of dimension n — 2. Continuing this 
construction inductively, and then changing variables back again, we easily obtain assertion (1). 

An application of Gaussian elimination to eliminate one monomial from each of the polynomials 
of G then gives us assertion (2). ■ 

To finally prove theorem |2j, let us make one last definition. 

Definition 6. Letting f be a bivariate m-nomial and Z the zero set of f in the positive orthant, 
define... 



m) 
rn) 
m) 



The maximal number of isolated singular points of such a Z . 
The maximal number of isolatea^ inflection points of such a Z . 
The maximal number of isolatea^ points of vertical tangency of Z. 

Theorem @ is then an immediate corollary of theorem H below. 



Definition 7. Let JC(n, n) be the maximal number of isolated roots in R™ of an n x n fewnomial 
system with exactly fi distinct exponent vectors. (So JC'(n, /i) <IC(n, fi).) o 

Theorem 4. Theorem^ is true. Furthermore, defining fC(n, 0) :— and following the notation of 
definition ^ we also have the following inequalities: 

4. S(m),V(m)<]C(n,m) 

5. S(m) + I(m)< 3/C'(n, m) for m<3 

Proof: Let us focus first on proving theorem E| To prove assertions (1) and (2), note that we can 
divide by a suitable monomial so that / has a nonzero constant term. By lemma fl], we have that 
for S > sufficiently small, it suffices to bound the number of compact and non-compact connected 

o 

components of Zg (a "thickening" of Z). In particular, Zs, the boundary of Z$, and its closure, 

o 

can be assumed to be smooth. Noting that every connected component of Zs is contained in some 

o 

connected component of Zs, it then suffices to bound the number of connected components of Zs- 
By proposition |l| and lemma [l0[ we can pick annxn matrix A so that, after we make the change 

A ° 

of variables x 1— > x , the number of compact and non-compact real connected components of Zs is 

o 

preserved and no connected component of Zs of positive dimension is contained in a hyper-planc 
parallel to the xi-coordinate hyperplane. Furthermore, we can also assume that every non-compact 

o 

component of Zs has unbounded values of x\. So we are now ready to use critical points to count 
connected components. 

13 Relative to the locus of inflection points. 

14 Relative to the locus of points of vertical tangency. 
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Consider then the system of equations G± :— (/ ± 6^x282/, ■ ■ ■ ,x n d n f), where di denotes the 

o 

operator By construction, every compact connected component of Z$ results in at least two 
extrema of the function x\, i.e., P C omp(",™) is bounded above by an integer no more than half of 
the total number of roots of G+ and G_ . (In particular, if Z were smooth to begin with, then 
it would suffice to count the isolated roots of G := (/, x-^dif, • ■ ■ , x n d n f) instead and omit the use 
of Z$ and G±.) Note also that by construction, all the roots of G± (or G) are non-degenerate. 
Furthermore, by a simple application of Gaussian Elimination, we obtain that G± (or G) is of type 
(m — 1, . . . , m — 1) (and there are no more than m distinct monomial terms occuring in G± or G), so 

v « ' 

n 

assertion (1) follows immediately. (The bound for P CO mp(l, Tn) follows immediately from UGDRS.) 

To prove assertion (2) of theorem^, another application of lemma [To| (and our much used proposi- 
tion |l|) tells us that we can assume that every unbounded connected component of Z has arbitrarily 
large values of x\ . For e > sufficiently small, we then observe that every such component induces at 
least one connected component of the intersection Z' :=Z P\{x | x\ — -}. So fix an e>0 sufficiently 
small so that this holds for all unbounded components. (Recall that there are only finite many, cf. 
the proof of lemma |To|.) Then, by substituting x\ — ~ into /, we obtain a new fewnomial hypersurface 
Z"CM n ~ , also defined by an m-nomial, with at least as many connected components as Z has un- 
bounded components. To conclude, note that under the change of variables x 1— ► [x^ x , ... , x" 1 ), the 
bounded non-compact components of Z are injectively embedded into the unbounded components 
of a new m-nomial hypersurface. So by what we've already proved for our unbounded components, 
we at last obtain P ncm (n, m) < 2(P coulp {n — 1, to) + P n0 n{n — 1, to)). 

The bound for P non (2,m) then follows from the now classical moment map. That is, given any 
n-dimensional convex compact polytope Pel", there is a real analytic diffeomorphism rp : M™ — > 
Int(P), where Int(P) denotes the interior of P [ Ful93| , sec. 4.2]. In particular, if one picks P to 



be the Newton polygon of / then there is a bijection between (a) the intersections of ip(Z) with 
the interior of an edge of P with inner normal w, and (b) the roots of the initial term polynomial 
m iij(/) '=^2c a x a in (R*) x {1}, where the sum is over all aGSupp(/) with minimal inner product 
with w. Since any non-compact component U of Z results in tp(U) having at least 2 intersections 
with the edges of P, UGDRS immediately implies our bound for P non (2,m), not to mention our 
bound for P non (l, to). (In fact, in our bound for P non (2, to), we can even replace to by the number 
of monomials correspondingto points on the boundary of P.) 

Assertion (3) of theorem |2| follows immediately from assertion (4), which we will now prove. First 
note that the singular points of Z are exactly the roots of the over-determined fewnomial system 
F := (/, xidif, a^c^/)- By lemma [l2] the singular points of Z are also contained in the roots of the 
system G :— (<?i, 172), where G is of type (to— 1, m— 1), has no more than m distinct exponent vectors, 
and each gi is a suitable linear combination of /, x\d\f , and Xi d%f . Furthermore, the real zero set 
of G is the union of the real zero set of F and a (possibly empty) finite set of points. This proves the 
bound on S(m), and the bound on V(m) is proved in almost exactly the same way, starting with 
the polynomial system (f,X2d2f) instead. So assertion (4) is proved. 

To prove assertion (5), note that by lemmata [l] and |^, {x\, X2) is an inflection point or a singular 
point of Z => f = q=Q, where q is a homogeneous polynomial, in the non-constant monomials 
terms of /, of degree at most 3. Letting Si, . . . , S m - 1 denote the non-constant monomials terms 
of /, note that each complex factor q' := aiSi + ■ • • + a m -iSm-i of q is a j-nomial for some 
j <m — 1. (Note that the fundamental theorem of algebra tells us that q indeed splits completely 
over C[5i, . . . , S m -i], provided m < 3.) Also note that if on 7^ 0, the fewnomial systems (1 + S\ + 
■ • • + S m -i, q') and G := (1 + S\ + ■ ■ ■ + S m -i — q 1 /on, q') have the same zero set, and on must be 
nonzero for some i. However, G is of type (to — 1, m — 1), has no more than to distinct exponent 
vectors, and has no degenerate roots. So the system (/, q) has at most 3Af'(m — 1, to — 1) isolated 
roots in the positive quadrant of the (x±, X2)-plane. So assertion (5) is proved. ■ 



Remark 13. The equality tC'(n,rn) = }C(n,m) appears to be known only for (n, to) £ (1 x N) U 
{(2, 2), (2,3), (2, 4)} and TO=n+l. These few cases follow easily from theorems |I] an<i[3|, via remark^, o 
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